Utilizing the Wikidata System to Improve the Quality of Medical Content in Wikipedia in Diverse Languages: A Pilot Study
نویسندگان
چکیده
BACKGROUND Wikipedia is an important source of medical information for both patients and medical professionals. Given its wide reach, improving the quality, completeness, and accessibility of medical information on Wikipedia could have a positive impact on global health. OBJECTIVE We created a prototypical implementation of an automated system for keeping drug-drug interaction (DDI) information in Wikipedia up to date with current evidence about clinically significant drug interactions. Our work is based on Wikidata, a novel, graph-based database backend of Wikipedia currently in development. METHODS We set up an automated process for integrating data from the Office of the National Coordinator for Health Information Technology (ONC) high priority DDI list into Wikidata. We set up exemplary implementations demonstrating how the DDI data we introduced into Wikidata could be displayed in Wikipedia articles in diverse languages. Finally, we conducted a pilot analysis to explore if adding the ONC high priority data would substantially enhance the information currently available on Wikipedia. RESULTS We derived 1150 unique interactions from the ONC high priority list. Integration of the potential DDI data from Wikidata into Wikipedia articles proved to be straightforward and yielded useful results. We found that even though the majority of current English Wikipedia articles about pharmaceuticals contained sections detailing contraindications, only a small fraction of articles explicitly mentioned interaction partners from the ONC high priority list. For 91.30% (1050/1150) of the interaction pairs we tested, none of the 2 articles corresponding to the interacting substances explicitly mentioned the interaction partner. For 7.21% (83/1150) of the pairs, only 1 of the 2 associated Wikipedia articles mentioned the interaction partner; for only 1.48% (17/1150) of the pairs, both articles contained explicit mentions of the interaction partner. CONCLUSIONS Our prototype demonstrated that automated updating of medical content in Wikipedia through Wikidata is a viable option, albeit further refinements and community-wide consensus building are required before integration into public Wikipedia is possible. A long-term endeavor to improve the medical information in Wikipedia through structured data representation and automated workflows might lead to a significant improvement of the quality of medical information in one of the world's most popular Web resources.
منابع مشابه
Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata
While Wikipedia exists in 287 languages, its content is unevenly distributed among them. In this work, we investigate the generation of open domain Wikipedia summaries in underserved languages using structured data from Wikidata. To this end, we propose a neural network architecture equipped with copy actions that learns to generate single-sentence and comprehensible textual summaries from Wiki...
متن کاملThe 5th ISCB Wikipedia Competition: Coming to a Classroom Near You?
The International Society for Computational Biology (ISCB) is pleased to announce the 5th ISCB Wikipedia Competition. The competition has been run annually since 2012 and awards students and trainees for the best contributions to computational biology-related articles [1]. ISCB runs the competition in collaboration with WikiProject Computational Biology, a group of around 130 editors overseeing...
متن کاملTowards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore
Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...
متن کاملWikidata: A Platform for Data Integration and Dissemination for the Life Sciences and Beyond
Wikidata is an open, Semantic Web-compatible database that anyone can edit. This ‘data commons’ provides structured data for Wikipedia articles and other applications. Every article on Wikipedia has a hyperlink to an editable item in this database. This unique connection to the world’s largest community of volunteer knowledge editors could help make Wikidata a key hub within the greater Semanti...
متن کاملA Comparative Study of the Nursing Undergraduate Program in Iran and Alice Lee University in Singapore
Introduction: Curriculum is the heart of any educational program and the key elements of higher education for transferring knowledge, attitude, and skills to students. Comparing different educational systems will improve the content and quality of the educational program and will help to improve it .The aim of this study was to compare nursing program in Iran and Alice Lee University in Singapo...
متن کامل